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TITLE 

Bifiinctional Selectable Fusion Genes 

5 

ffft^nPOTTlsm OF T«P TWFNTTON 

The present invention relates generally to genes expressing selectable 
phenotypes. More particularly, the present invention relates to genes capable of co- 
expressing both dominant positive selectable and negative selectable phenotypes. 

10 Genes which express a selectable phenotype are widely used in recombinant 

DNA technology as a means for identifying and isolating host cells into which the gene 
has been introduced. Typically, the gene expressing the selectable phenotype is 
introduced into the host cell as part of a recombinant expression vector. Positive 
selectable genes provide a means to identify and/or isolate cells that have retained 

15 introduced genes in a stable form, and, in this capacity, have gready facilitated gene 
transfer and the analysis of gene function. Negative selectable genes, on the other 
hand, provide a means for eliminating cells mat retain the introduced gene. 

A variety of genes are available which confer selectable phenotypes on animal 
cells. The bacterial neomycin phosphotransferase (neo) (Colbere-Garapin et al., J. 

20 Mol. Biol. 150:1, 1981), hygromycin phosphotransferase (hph) (Santerre et al., Gene 
50:147, 1984), and xanthine-guanine phosphoribosyl transferase (gpt) (Mulligan and 
Berg, Proc. Natl. Acad. Sci. USA 78:2072, 1981) genes are widely used dominant 
positive selectable genes. The Herpes simplex virus type I mymidine kinase (HSV-I 
TK) gene (Wigler et al., Cell 11:223, 1977), and the cellular adenine 

25 phosphoribosyltransferase (APRT) (Wigler et al., Proc. Natl. Acad. Sci. USA 
7d:1373, 1979) and hypoxanthine phosphoribosyltransferase (HPRT) genes (Jolly et 
al., Proc. Natl. Acad. Sci. USA 80:477, 1983) are commonly used recessive positive 
selectable genes. In general, dominant selectable genes are more versatile than 
recessive genes, because the use of recessive genes is limited to mutant cells deficient in 

30 the selectable function, whereas dominant genes may be used in wild-type cells. 

Several genes confer negative as well as positive selectable phenotypes, 
including the HSV-I TK, HPRT, APRT and gpt genes. These genes encode enzymes 
which catalyze the conversion of nucleoside or purine analogs to cytotoxic 
intermediates. The nucleoside analog GCV is an efficient substrate for HSV-I TK, but 

35 a poor substrate for cellular TK and therefore may be used for negative selection 
against the HSV-I TK gene in wild-type cells (St Clair et al., Antimicrob. Agents 
Chemother. 5/:844, 1987). However, the HSV-I TK gene may only be used 
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effectively for positive selection in mutant cells lacking cellular TK activity. Use of the 
HPRT and APRT genes for either positive or negative selection is similarly limited to 
HPRT- or APRT- cells, respectively (Fenwick, "The HGPRT System", pp. 333-373, 
M. Gottesman (ed), Molecular Cell Genetics, John Wiley and Sons, New York, 1985; 
5 Taylor et aL, 'The APRT System", pp., 31 1-332, M Gottesman (ed.), Molecular Cell 
Genetics, John Wiley and Sons, New York, 1985). The gpt gene, on the other hand, 
may be used for both positive and negative selection in wild-type cells. Negative 
selection against the gpt gene in wild-type cells is possible using 6-thioxanthine, which 
is efficiently converted to a cytotoxic nucleotide analog by the bacterial gpt enzyme, but 

10 not by the cellular HPRT enzyme (Besnaid et aL, Mol Cell Biol 7:4139, 1987). 

More recently, attention has turned to selectable genes that may be incorporated 
into gene transfer vectors designed for use in human gene therapy. Gene therapy is a 
method for permanently curing a hereditary genetic disease which results from a defect 
in or absence of one or more genes. Collectively, such diseases result in significant 

15 morbidity and mortality. Examples of such genetic diseases include hemophilias A and 
B (caused by a deficiency of blood coagulation factors VIQ and DC, respectively), 
alpha-l-antitiypsin deficiency, and adenosine deaminase deficiency. In each of these 
particular cases, the missing gene has been identified and its complementary DNA 
(cDNA) molecularly cloned (Wood et aL, Nature 312:330, 1984; Anson et aL, Nature 

20 315 :683, 1984; and Long et aL, Biochemistry 25:4828, 1984; Daddona et aL, 7. Biol 
Chem. 259:12101, 1984). While palliative therapy is available for some of these 
genetic diseases, often in the form of administration of blood products or blood 
transfusions, one way of permanently curing such genetic diseases is to introduce a 
replacement for the defective or missing gene back into the somatic cells of the patient, 

25 a process referred to as "gene therapy" (Anderson, Science 226:401, 1984). Gene 
therapy can also be used as a means for augmenting normal gene function, for example, 
by introducing a heterologous gene capable of modifying cellular activities or cellular 
phenotype, or alternatively, expressing a drug needed to treat a disease. 

The process of gene therapy typically involves the steps of (1) removing 

30 somatic (non-germ) cells from the patient, (2) introducing into the cells ex vivo a 
replacement gene via an appropriate vector capable of expressing the replacement gene, 
and (3) transplanting or transfusing these cells back into the patient, where the 
replacement gene is expressed to provide some therapeutic benefit Gene transfer into 
somatic cells for human gene therapy is presently achieved ex vivo (Kasid et aL, Proc. 

35 Natl Acad. Sci. USA 87:473, 1990; Rosenberg et aL, N. Engl. J. Med. 525:570, 
1990), and this relatively inefficient process would be facilitated by the use of a 
dominant positive selectable gene for identifying and isolating those cells into which the 
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replacement gene has been introduced before they are returned to the patient The neo 
gene, for example, has been used to identify genetically modified cells used in human 
gene therapy. 

In some instances, however, it is possible that the introduction of genetically 
5 modified cells may actually compromise the health of the patient The ability to 
selectively eliminate genetically modified cells in vivo would provide an additional 
margin of safety for patients undergoing gene therapy, by permitting reversal of the 
procedure. This might be accomplished by incorporating into the vector a negative 
selectable (or 'suicide') gene that is capable of functioning in wild-type cells. 

10 Incorporation of a gene capable of conferring both dominant positive and negative 
selectable phenotpyes would ensure co-expression and co-regulation of the positive and 
negative selectable phenotypes, and would minimize the size of the vector. However, 
positive selection for the gpt gene in some instances requires precise selection 
conditions which may be difficult to determine. Moreover, the feasibility of using the 

15 gpt gene for in vivo negative selection has not yet been clearly established. For these 
reasons, co-expression of a dominant positive selectable phenotype and a negative 
selectable phenotype is typically achieved by co-expressing two different genes which 
separately encode other dominant positive and negative selectable functions, rather than 
using the gpt gene. 

20 The existing strategies for co-expressing dominant positive and negative 

selectable phenotypes encoded by different genes often present complex challenges. As 
indicated above, the most widely used technique is to co-transfect two plasmids 
separately encoding two phenotypes (Wigler et al., Cell 16:777, 1979). However, the 
efficiency of co-transfer is rarely 100%, and the two genes may be subject to 

25 independent genetic or epigenetic regulation. A second strategy is to link the two genes 
on a single plasmid, or to place two independent transcription units into a viral vector. 
This method also suffers from the disadvantage that the genes may be independently 
regulated. In retroviral vectors, suppression of one or the other independent 
transcription unit may occur (Emerman and Temin, Mol. Cell Biol 6:792, 1986). In 

30 addition, in some circumstances there may be insufficient space to accommodate two 
functional transcription units within a viral vector, although retroviral vectors with 
functional multiple promoters have been successfully made (Overell et al., Mol Cell 
Biol &1803, 1988). A third strategy is to express the two genes as a bicistronic 
mRNA using a single promoter. With this method, however, the distal open reading 

35 frame is often translated with variable (and usually reduced) efficiency (Kaufman et al., 
EMBO /. 6:187, 1987), and it is unclear how effective such an expression strategy 
would be in primary cells. 
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The present invention provides a method for more efficiently and reliably co- 
expressing a dominant positive selectable phenotype and a negative selectable 
phenotype encoded by different genes. 

5 SUMMARI OF THE 1NYENTTQN 

The present invention provides a selectable fusion gene comprising a dominant 
positive selectable gene fused to and in reading frame with a negative selectable gene. 
The selectable fusion gene encodes a single bifunctional fusion protein which is capable 
of conferring a dominant positive selectable phenotype and a negative selectable 

10 phenotype on a cellular host In a preferred embodiment, the selectable fusion gene 
comprises nucleotide sequences from the hph gene fused to nucleotide sequences from 
the HSV-I TK gene, referred to herein as the HyTK selectable fusion gene (Sequence 
Listing No. 1). The HyTK selectable fusion gene confers both hygromycin B 
resistance (Hrrf) for dominant positive selection and ganciclovir sensitivity (GCV 8 ) for 

15 negative selection. 

The present invention also provides recombinant expression vectors, for 
example, retroviruses, which include the selectable fusion genes, and cells transduced 
with the recombinant expression vectors. 

The selectable fusion genes of the present invention are expressed and 

20 regulated as a single genetic entity, permitting co-regulation and co-expression with a 
high degree of efficiency. 

BRIEF DESCRIPTION OF THE 

Figure 1 shows diagrams of the plasmids tgCMV/hygro, tgCMV/TK and 
25 tgCMV/HyTK which contain proviral structures used in the present invention. The 
three plasmids are identical, except for the genes inserted between the HCMV promoter 
(filled box) and the SV40 early region polyadenylation signal (hatched box). 

Figure 2 shows diagrams of the proviral structures from the plasmids shown in 
Figure 1. The horizontal arrows indicate transcriptional start sites and direction of 
30 transcription. The open box labeled LTR is the retroviral long terminal repeat The 
viral splice donor is labeled SD and the acceptor sequences are labeled SA. The open 
box labeled CMV is the cytomegalovirus promoter. In tgLS(+)HyTK/stop, the 
positions of the two internal initiation codons retained in the HyTK selectable fusion 
gene are indicated by vertical arrows. The location at which the universal translation 
35 terminator oligonucleotide was inserted is also marked 

Figures 3 and 4 are graphs showing the results of a short-term proliferation 
assay in which the hygromycin resistant (HmF) NIH/3T3 cell pools and Hrrf and HAT 
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resistant (HAT 1 ) Rat-2 cell pools were tested for ganciclovir sensitivity (GCV S ). Figure 
3 shows that GCV inhibits growth of NIH/3T3 cells transfected with tgCMV/HyTK, 
but does not inhibit growth of NIH/3T3 cells transfected with tgCMV/Hygro. Figure 4 
shows that GCV inhibits growth of Rat-2 cells transfected with tgCMV/HyTK (initially 
5 selected for Hm 1 or HAT 1 ) even at the lowest concentrations of GCV, and also inhibits 
growth of Rat-2 ceUs transfected with tgCMV/TK, although at slightly higher 
concentrations. GCV did not inhibit growth of Rat-2 cells transfected with 
tgCMV/Hygro. 

Figure 5 shows the results of Northern analysis of Hm r and HAT cell pools. 

10 Polyadenylated mRNA was extracted from each Hrtf and HAT 1 cell pool, and used to 
prepare Northern blots which were probed with sequences from the hph gene (Panel 
A), the HSV-I TK gene (Panel B), or the B-actin gene (Panel C) (for mRNA 
equivalence). The positions of the 28S and 18S ribosomal RNAs are indicated. The 
mRNA present in each lane was extracted from the following cells: Lane 1, Rat-2 cells 

15 transfected with tgCMV/hygro; Lane 2, Rat-2 cells transfected with tgCMV/TK; Lane 
3, Rat-2 cells transfected with tgCMV/HyTK and selected for Hm r ; Lane 4, Rat-2 cells 
transfected with tgCMV/HyTK and selected for HAT r ; Lane 5, NIH/3T3 cells 
transfected with tgCMV/hygro; Lane 6, NIH/3T3 cells transfected with tgCMV/HyTK. 
Figure 6 shows photographs of stained colonies of uninfected NIH/3T3 cells 

20 (plates a, b and c) and NIH/3T3 cells infected with the tgLS(+)HyTK (plates d and e) 
or tgLS(-) CMV/HyTK (plates f and g) retroviruses. The cells were grown in medium 
alone or medium supplemented with GCV, Hm or Hm plus GCV in a long-term 
proliferation assay. The data show that uninfected NIH/3T3 cells were resistant to 
GCV and grew to confluence (plate b), but were killed by Hm (plate c). Growth of 

25 NIH/3T3 cells infected with the tgLS(+)HyTK and tgLS(-) CMV/HyTK retroviruses 
and grown in the presence of Hm (plates d and f) was inhibited by GCV (plates e and 
g). 



OETATLED DESCRIPTION OF THF. INVENTION 

30 

SEQ ID NO:l and SEQ ID NO:2 (appearing immediately prior to the claims) 
show specific embodiments of the nucleotide sequence and corresponding amino acid 
sequence of the HyTK selectable fusion gene of the present invention. The HyTK 
selectable fusion gene shown in the Sequence Listing comprises sequences from the 
35 hph gene (nucleotides 1-971) linked to sequences from the HSV-I TK gene 
(nucleotides 972-2076). 
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Definitions 

As used herein, the tenn "selectable fusion gene" refers to a nucleotide sequence 
comprising a dominant positive selectable gene which is fused to and in reading frame 
with a negative selectable gene and which encodes a single bifiinctional fusion protein 
5 which is capable of conferring a dominant positive selectable phenotype and a negative 
selectable phenotype on a cellular host A "dominant positive selectable gene" refers to 
a sequence of nucleotides which encodes a protein conferring a dominant positive 
selectable phenotype on a cellular host, and is discussed and exemplified in further 
detail below. A "negative selectable gene" refers to a sequence of nucleotides which 

10 encodes a protein conferring a negative selectable phenotype on a cellular host, and is 
also discussed and exemplified in further detail below, A "selectable gene" refers 
generically to dominant positive selectable genes and negative selectable genes. 

A selectable gene is "fused to and in reading frame with" another selectable gene 
if the expression products of the selectable genes (i.e., the proteins encoded by the 

15 selectable genes) are fused by a peptide bond and at least part of the biological activity 
of each of the two proteins is retained. With reference to the HyTK selectable fusion 
gene disclosed herein, for example, the hph gene (encoding hygromycin-B 
phosphotransferase, which confers the dominant positive selectable phenotype of 
hygromycin resistance (Hm 1 )) is fused to and in reading frame with the HSV-I TK gene 

20 (encoding Herpes Simplex Virus Type I thymidine kinase, which confers a negative 
selectable phenotype of ganciclovir sensitivity, or (GCV S )) if ihe hph and HSV-I TK 
proteins are fused by a peptide bond and expressed as a single Afunctional fusion 
protein. The component selectable gene sequences of the present invention are 
preferably contiguous; however, it is possible to construct selectable fusion genes in 

25 which the component selectable gene sequences are separated by internal nontranslated 
nucleotide sequences, such as introns. For purposes of the present invention, such 
noncontiguous selectable gene sequences are considered to be fused, provided that 
expression of the selectable fusion gene results in a single bifunctional fusion protein in 
which the expression products of the component selectable gene sequences are fused by 

30 a peptide bond. 

"Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides or 
ribonucleotides, such as a DNA or RNA sequence. Nucleotide sequences may be in 
the form of a separate fragment or as a component of a larger construct Preferably, the 
nucleotide sequences are in a quantity or concentration enabling identification, 

35 manipulation, and recovery of the sequence by standard biochemical methods, for 
example, using a cloning vector. Recombinant nucleotide sequences are the product of 
various combinations of cloning, restriction, and ligation steps resulting in a construct 
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having a structural coding sequence distinguishable from homologous sequences found 
in natural systems. Generally, nucleotide sequences encoding the structural coding 
sequence, for example, the selectable fusion genes of the present invention, can be 
assembled from nucleotide fragments and short oligonucleotide linkers, or from a series 
5 of oligonucleotides, to provide a synthetic gene which is capable of being expressed in 
a recombinant transcriptional unit Such sequences are preferably provided in the form 
of an open reading frame uninterrupted by internal nontranslated sequences, or introns, 
which are typically present in eukaryotic genes. Genomic DNA containing the relevant 
selectable gene sequences is preferably used to obtain appropriate nucleotide sequences 

10 encoding selectable genes; however, cDNA fragments may also be used Sequences of 
non-translated DNA may be present 5* or 3' from the open reading frame or within the 
open reading frame, provided such sequences do not interfere with manipulation or 
expression of the coding regions. Some genes, however, may include introns which 
are necessary for proper expression in certain hosts, for example, the HPRT selectable 

15 gene includes introns which are necessary for expression in embryonal stem (ES) cells. 
As suggested above, the nucleotide sequences of the present invention may also 
comprise RNA sequences, for example, where the nucleotide sequences are packaged 
as RNA in a retrovirus for infecting a cellular host The use of retroviral expression 
vectors is discussed in greater detail below. 

20 The term "recombinant expression vector" refers to a replicable unit of DNA or 

RNA in a form which is capable of being transduced into a target cell by transfection or 
viral infection, and which codes for fee expression of a selectable fusion gene which is 
transcribed into mRNA and translated into protein under the control of a genetic element 
or elements having a regulatory role in gene expression, such as transcription and 

25 translation initiation and termination sequences. The recombinant expression vectors of 
the present invention can take the form of DNA constructs replicated in bacterial cells 
and transfected into target cells directly, for example, by calcium phosphate 
precipitation, electroporation or other physical transfer methods. The recombinant 
expression vectors which take the form of RNA constructs may, for example, be in the 

30 form of infectious retroviruses packaged by suitable "packaging" cell lines which have 
previously been transfected with a proviral DNA vector and produce a retrovirus 
containing an RNA transcript of the proviral DNA. A host cell is infected with the 
retrovirus, and the retroviral RNA is replicated by reverse transcription into a double- 
stranded DNA intermediate which is stably integrated into chromosomal DNA of the 

35 host cell to form a provirus. The provirus DNA is then expressed in the host cell to 
produce polypeptides encoded by the DNA. The recombinant expression vectors of the 
present invention thus include not only RNA constructs present in the infectious 
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retrovirus, but also copies of proviral DNA, which include DNA reverse transcripts of 
a retrovirus RNA genome stably integrated into chromosomal DNA in a suitable host 
cell, or cloned copies thereof, or cloned copies of unintegrated intermediate forms of 
retroviral DNA. Proviral DNA includes transcriptional elements in independent 
5 operative association with selected structural DNA sequences which arc transcribed into 
mRNA and translated into protein when proviral sequences are expressed in infected 
host cells. Recombinant expression vectors used for direct transfection will include 
DNA sequences enabling replication of the vector in bacterial host cells. Various 
recombinant expression vectors suitable for use in the present invention are described 
10 below. 

'Transduce 1 1 means introduction of a recombinant expression vector containing 
a selectable fusion gene into a cell. Transduction methods may be physical in nature 
(i.e., transfection), or they may rely on the use of recombinant viral vectors, such as 
retroviruses, encoding DNA which can be transcribed to RNA, packaged into 

15 infectious viral particles and used to infect target cells and thereby deliver the desired 
genetic material (i.e., infection). Many different types of mammalian gene transfer and 
recombinant expression vectors have been developed (see, e.g., Miller and Calos, eds., 
"Gene Transfer Vectors for Mammalian Cells," Current Comm. MoL Biol, (Cold 
Spring Harbor Laboratory, New York, 1987)). Naked DNA can be physically 

20 introduced into mammalian cells by transfection using any one of a number of 
techniques including, but not limited to, calcium phosphate transfection (Berman et al., 
Proc. Natl. Acad. Sci. USA 84 81: 7176, 1984), DEAE-Dextran transfection 
(McCutchan et al., /. Natl Cancer Inst. 41:351, 1986; Luthman et al., Nucl. Acids 
Res. 11: 1295, 1983), protoplast fusion (Deans et al., Proc. Natl. Acad. Sci. USA 84 

25 81: 1292, 1984), electroporation (Potter et al., Proc. Natl Acad. Sci. USA 84 81: 
7161, 1984), lipofection (Feigner et al., Proc. Natl. Acad. Sci. USA S*7413, 1987), 
polybrene transfection (Kawai and Nishzawa, Mol Cell. Biol 4:1172, 1984) and direct 
gene transfer by laser micropuncture of cell membranes (Tao et aL, Proc. Natl. Acad. 
Sci. USA 84 84:4180, 1987). Various infection techniques have been developed 

30 which utilize recombinant infectious virus particles for gene delivery. This represents a 
preferred approach to the present invention. The viral vectors which have been used in 
this way include virus vectors derived from simian virus 40 (SV40; Karlsson et al., 
Proc. Natl. Acad. Sci. USA 84 82:158, 1985), adenoviruses (Karlsson et al., EMBO 
J. 5:2377, 1986), adeno-associated virus (LaFace et aL, Virology 762:483, 1988) and 

35 retroviruses (Coffin, 1985, pl7-71 in Weiss et al (eds.), RNA Tumor Viruses, 2nd 
ed., Vol 2, Cold Spring Harbor Laboratory, New York). Thus, gene transfer and 
expression methods are numerous but essentially function to introduce and express 
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genetic material in mammalian cells- Several of the above techniques have been used to 
transduce hematopoietic or lymphoid cells, including calcium phosphate transfection 
(Bennan et aL, supra, 1984), protoplast fusion (Deans et aL, supra, 1984), 
electroporation (Cann et aL, Oncogene 3:123, 1988), and infection with recombinant 
5 adenovirus (Karlsson et aL, supra; Ruether et aL, Mol Cell Biol 6:123, 1986) adeno- 
associated virus (LaFace et aL, supra) and retrovirus vectors (Overell et aL, Oncogene 
4:1425, 1989). Primary T lymphocytes have been successfully transduced by 
electroporation (Cann et aL, supra, 1988) and by retroviral infection (Nishihara et aL, 
Cancer Res. 48:4730, 1988; Kasid et aL, supra, 1990). 

10 

Construction of Selectable Fusion Genes 

The selectable fusion genes of the present invention comprise a dominant 
positive selectable gene fused to a negative selectable gene. A selectable gene will 
generally comprise, for example, a gene encoding a protein capable of conferring an 

15 antibiotic resistance phenotype or supplying an autotrophic requirement (for dominant 
positive selection), or activating a toxic metabolite (for negative selection). A DNA 
sequence encoding a bifunctional fusion protein is constructed using recombinant DNA 
techniques to assemble separate DNA fragments encoding a dominant positive 
selectable gene and a negative selectable gene into an appropriate expression vector. 

20 The 3* end of one selectable gene is ligated to the 5* end of the other selectable gene, 
with the reading frames of the sequences in frame to permit translation of the mRNA 
sequences into a single biologically active bifunctional fusion protein. The selectable 
fusion gene is expressed under control of a single promoter. 

The dominant positive selectable gene is any gene which, upon being 

25 transduced into a host cell, expresses a dominant phenotype permitting positive 
selection of stable transductants. Selection of stable transductants can be carried out, 
for example, using the hygromycin-B phosphotransferase gene {hph) which confers the 
selectable phenotype of hygromycin resistance (Hm 1 ) (Santerre et aL, Gene 50:147, 
1984; Sugden et aL, MoL Cell Biol 5:410, 1985; obtainable from plasmid pHEBol, 

30 under ATCC Accession No. 39820). Hygromycin B is an aminoglycoside antibiotic 
that inhibits protein synthesis by disrupting translocation and promoting mistranslation. 
The hph gene confers Hm r to cells transduced with the hph gene by phosphorylating 
and detoxifying the antibiotic hygromycin B. Other acceptable dominant positive 
selectable genes include the following: the aminoglycoside phosphotransferase gene 

35 (neo or aph) from Tn5 which codes for resistance to the antibiotic G418 (Colbere- 
Garapin et aL, J. Mol Biol 150:1, 198; Southern and Berg, /. Mol Appl Genet. 
7:327, 1982); the xanthine-guanine phosphoribosyl transferase gene (gpt) from E. coli 
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encoding resistance to mycophenolic acid (Mulligan and Berg, Proc. NatL Acad. Sci 
USA 75:2072, 1981); the dihydrofolate reductase (DHFR) gene from murine cells or 
E. coli which is necessary for biosynthesis of purines and can be competitively 
inhibited by the drug methotrexate (MTX) to select for cells constitutively expressing 
5 increased levels of DHFR (Simonsen and Levinson, Proc. NatL Acad. ScL (USA.) 
80:2495, 1983; Simonsen et al., NucL Acids. Res. 26:2235, 1988); the S. 
typhimurium histidinol dehydrogenase (hisD) gene (Hartman, et al, Proc. Natl. Acad. 
ScL (USA) 55:8047, 1988); the E. coli tryptophan synthase p subunit (trpB) gene 
(Hartman et al., supra); the puromycin-N-acetyl transferase (pad) gene (Vara et aL, 

10 NucL Acids Res. 14:4117, 1986); the adenosine deaminase (ADA) gene (Daddona et 
al., /. Biol. Chem. 259:12101, 1984); the multi-drug resistance (MDR) gene (Kane et 
al., Gene 54:439, 1989); the mouse ornithine decarboxylase (OCD) gene (Gupba and 
Coffino, /. Biol. Chem. 260:2941, 1985); the E. coli aspartate transcarbamylase 
catalytic subunit (pyrB) gene (Ruiz and Wahl, Mot. Cell. BioL 6:3050, 1986); and the 

15 E. coli asnA gene, encoding asparagine sythetase (Cartier et al., Mol. Cell. Biol. 
7:1623, 1987). 

The negative selectable gene is any gene which, upon being transduced into a 
host cell, expresses a phenotype permitting negative selection (i.e., elimination) of 
stable transductants. In preferred embodiments, the negative selectable gene used in the 

20 fusion genes of the present invention is the Herpes simplex virus type I thymidine 
kinase (HSV-I TK> gene (Wigler et aL, Cell 11:223, 1977; McKnight et al., NucL 
Acids Res. 5:5931, 1980; Preston et al., /. Virol. 58:593, 1981; Wagner et al., Proc. 
NatL Acad. Sci. USA 78: 1441, 1981) which confers ganciclovir sensitivity (GCV S ) 
(St Clair et aL,Antimicrob. Agents Chemother. 57:844, 1987). The HSV-I TK gene 

25 is available from Bethesda Research Labs (Catalog No. BRL 5365 SA). Negative 
selection can also be performed, for example, using the cellular hypoxanthine 
phosphoribosyltransferase (HPRT) gene (Jolly et al., Proc. Natl. Acad. Sci. USA 
80:477, 1983; Fenwick, "The HGPRT System", pp. 333-373, M. Gottesman (ed.), 
Molecular Cell Genetics, (John Wiley and Sons, New York, 1985)) and the cellular 

30 adenine phosphoribosyltransferase (APRT) gene (Wigler et aL, Proc. NatL Acad. Sci. 
USA 76:1373, 1979; Taylor et al., "The APRT System", pp., 311-332, M. Gottesman 
(ed.), Molecular Cell Genetics, (John Wiley and Sons, New Yoik, 1985)); and die E. 
coli gpt gene (Besnard et al., Mol. Cell. BioL 7:4139, 1987). 

Due to the degeneracy of the genetic code, there can be considerable variation in 

35 nucleotide sequences encoding the same amino acid sequence; exemplary DNA 
embodiments are those corresponding to the nucleotide sequences shown in Sequence 
Listing No. 1 . Such variants will have modified DNA or amino acid sequences, having 



WO 92/08796 



PCT/US91/08442 



11 

one or more substitutions, deletions, or additions, the net effect of which is to retain 
biological activity, and may be substituted for the specific sequences disclosed herein. 
The sequences of selectable fusion genes comprising hph and TK are equivalent if they 
contain all or part of the sequences of hph and HSV-I TK and are capable of 
5 hybridizing to the nucleotide sequence of Sequence Listing No. 1 under moderately 
stringent conditions (50°C, 2 X SSC) and express a biologically active fusion protein. 
A "biologically active" fusion protein will share sufficient amino acid sequence 
similarity with the specific embodiments of the present invention disclosed herein to be 
capable of conferring the selectable phenotypes of the component selectable genes. 

10 In a preferred embodiment, sequences from the bacterial hygromycin 

phosphotransferase {hph) gene are fused with sequences from the HSV-I TK gene. 
The resulting selectable fusion gene (referred to as the HyTK selectable fusion gene) 
encodes a bifunctional fusion protein that confers Hm r and GCV S , and provides a 
means by which dominant positive and negative selectable phenotypes may be 

15 expressed and regulated as a single genetic entity. The HyTK selectable fusion gene is 
therefore a useful addition to the existing panel of selectable genes available for use in 
animal cells, because it allows both dominant positive and negative selection in wild- 
type cells. 

20 Recombinant Expression Vectors 

The selectable fusion genes of the present invention are utilized to identify, 
isolate or eliminate host cells into which the selectable fusion genes are introduced. The 
selectable fusion genes are introduced into the host cell by transducing into the host cell 
a recombinant expression vector which contains the selectable fusion gene. Such host 

25 cells include cell types from higher eukaryotic origin, such as mammalian or insect 
cells, or cell types from lower prokaryotic origin, such as bacterial cells, for example, 
is. coli. 

As indicated above, such selectable fusion genes are preferably introduced into 
a particular cell as a component of a recombinant expression vector which is capable of 

30 expressing the selectable fusion gene within the cell and conferring a selectable 
phenotype. Such recombinant expression vectors generally include synthetic or natural 
nucleotide sequences comprising the selectable fusion gene operably linked to suitable 
transcriptional or translational control sequences, for example, an origin of replication, 
optional operator sequences to control transcription, a suitable promoter and enhancer 

35 linked to the gene to be expressed, and other 5' or 3' flanking nontranscribed 
sequences, and 5 1 or 3' nontranslated sequences, such as necessary ribosome binding 
sites, a polyadenylation site, splice donor and acceptor sites, and transcriptional 
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termination sequences. Such regulatory sequences can be derived from mammalian, 
viral, microbial or insect genes. Nucleotide sequences are operably linked when they 
are functionally related to each other. For example, a promoter is operably linked to a 
selectable fusion gene if it controls the transcription of the selectable fusion gene; or a 

5 ribosome binding site is operably linked to a selectable fusion gene if it is positioned so 
as to permit translation of the selectable fusion gene into a single Afunctional fusion 
protein. Generally, operably linked means contiguous. 

Specific recombinant expression vectors for use with mammalian, bacterial, and 
yeast cellular hosts are described by Pouwels et al. (Cloning Vectors: A Laboratory 

10 Manual, Elsevier, New York, 1985) and are well-known in the art A detailed 
description of recombinant expression vectors for use in animal cells can be found in 
Rigby, /. Gen. Virol 64:255, 1983; Elder et al., Ann. Rev. Genet. 75:295, 1981; and 
Subramani et al., Anal. Biochem. 755:1, 1983. Appropriate recombinant expression 
vectors may also include viral vectors, in particular retroviruses (discussed in detail 

15 below). 

The selectable fusion genes of the present invention are preferably placed under 
the transcriptional control of a strong enhancer and promoter expression cassette. 
Examples of such expression cassettes include the human cytomegalovirus immediate- 
early (HCMV-IE) promoter (Boshart et al., Cell 47:521, 1985), the p-actin promoter 

20 (Gunning et al., Proc. Natl. Acad. Sci. (USA) 54:5831, 1987), the histone H4 
promoter (Guild et al., /. Virol. 62:3795, 1988), the mouse metallothionein promoter 
(Mclvor et al., Mol. Cell. Biol. 7:838, 1987), the rat growth hormone promoter (Miller 
et aL, Mol. Cell. Biol. 5:431, 1985), the human adenosine deaminase promoter 
(Hantzapoulos et aL, Proc. Natl. Acad. Sci. USA 56:3519, 1989) the HSV ^promoter 

25 (Tabin et al., Mol. Cell. Biol. 2:426, 1982), the a-1 antitrypsin enhancer (Peng et aL, 
Proc. Natl. Acad. Sci. USA 85:8146, 1988) and the immunoglobulin 
enhancer/promoter (Blankenstein et al., Nucleic Acid Res. 76:10939, 1988), the SV40 
early or late promoters, the Adenovirus 2 major late promoter, or other viral promoters 
derived from polyoma virus, bovine papilloma virus, or other retroviruses or 

30 adenoviruses. The promoter and enhancer elements of immunoglobulin (Ig) genes 
confer marked specificity to B lymphocytes (Banerji et aL, Cell 55:729, 1983; Gillies et 
al., Cell 55:717, 1983; Mason et al., Cell 41:479, 1985), while the elements controlling 
transcription of the p-globin gene function only in aythroid cells (van Assendelft et al., 
Cell 56:969, 1989). Using well-known restriction and ligation techniques, appropriate 

35 transcriptional control sequences can be excised from various DNA sources and 
integrated in operative relationship with the intact selectable fusion genes to be 
expressed in accordance with the present invention. Thus, many transcriptional control 
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sequences may be used successfully in retroviral vectors to direct the expression of 
inserted genes in infected cells. 

Retroviruses 

5 Retroviruses can be used for highly efficient transduction of the selectable 

fusion genes of the present invention into eukaryotic cells and are preferred for the 
delivery of a selectable fusion gene into primary cells. Moreover, retroviral integration 
takes place in a controlled fashion and results in the stable integration of one or a few 
copies of the new genetic information per cell. 

10 Retroviruses are a class of viruses whose genome is in the form of RNA. The 

genomic RNA of a retrovirus contains transacting gene sequences coding for three 
viral proteins: a structural protein gag which associates with the RNA in the core of the 
virus particle; the reverse transcriptase pol which makes the DNA complement; and a 
envelope glycoprotein env which resides in the lipoprotein envelope of the particles and 

15 binds the virus to the surface of host cells on infection. Replication of the retrovirus is 
regulated by cw-acting elements, such as the promoter for transcription of the proviral 
DNA and other nucleotide sequences necessary for viral replication. The cw-acting 
elements are present in or adjacent to two identical untranslated long terminal repeats 
(LTRs) of about 600 base pairs present at the 5* and 3' ends of the retroviral genome. 

20 Retroviruses replicate by copying their RNA genome by reverse transcription into a 
double-stranded DNA intermediate, using a virus-encoded, RNA-directed DNA 
polymerase, or reverse transcriptase. The DNA intermediate is integrated into 
chromosomal DNA of an avian or mammalian host cell. The integrated retroviral DNA 
is called a provirus. The provirus serves as template for the synthesis of RNA chains 

25 for the formation of infectious virus particles. Forward transcription of the provirus 
and assembly into infectious virus particles occurs in the presence of an appropriate 
helper virus having endogenous *ra/ts-acting genes required for viral replication. 

Retroviruses are used as vectors by replacing one or more of the endogenous 
trans-acting genes of a proviral form of the retrovirus with a recombinant therapeutic 

30 gene or, in the case of the present invention, a selectable fusion gene, and then 
transducing the recombinant provirus into a cell. The trans-acting genes include the 
gag 9 pol and env genes which encode, respectively, proteins of the viral core, the 
enzyme reverse transcriptase and constituents of the envelope protein, all of which are 
necessary for production of intact virions. Recombinant retroviruses deficient in the 

35 amy-acting gag, pol or env genes cannot synthesize essential proteins for replication 
and are accordingly replication-defective. Such replication-defective recombinant 
retroviruses are propagated using packaging cell lines. These packaging cell lines 
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contain integrated retroviral genomes which provide all tra/is-acting gene sequences 
necessary for production of intact virions. Proviral DNA sequences which are 
transduced into such packing cells lines are transcribed into RNA and encapsidated into 
infectious virions containing the selectable fusion gene (and/or therapeutic gene), but, 
5 lacking the rra/tf-acting gene products gag, pol and env, cannot synthesize the 
necessary gag, pol and env proteins for encapsidating the RNA into particles for 
infecting other cells. The resulting infectious retrovirus vectors can therefore infect 
other cells and integrate a selectable fusion gene into the cellular DNA of a host cell, but 
cannot replicate. Mann et al. {Cell 35:153, 1983), for example, describe the 

10 development of various packaging cell lines (e.g., \p2) which can be used to produce 
helper virus-free stocks of recombinant retrovirus. Encapsidation in a cell line 
harboring rr<ms-acting elements encoding an ecotropic viral envelope (e.g., \jr2) 
provides ecotropic (limited host range) progeny virus. Alternatively, assembly in a cell 
line containing anisotropic packaging genes (e.g., PA317, ATCC CRL 9078; Miller 

15 and Buttimore, Mol Cell Biol 6:2895, 1986) provides amphotropic (broad host 
range) progeny virus. 

Numerous provirus constructs have been used successfully to express foreign 
genes (see, e.g., Coffin, in Weiss et al. (eds.), RNA Tumor Viruses, 2nd Ed., VoL 2, 
(Cold Spring Harbor Laboratory, New York, 1985, pp. 17-71). Most proviral 

20 elements are derived from murine retroviruses. Retroviruses adaptable for use in 
accordance with the present invention can, however, be derived from any avian or 
m a mm a lian cell source. Suitable retroviruses must be capable of infecting cells which 
are to be the recipients of the new genetic material to be transduced using the retroviral 
vector. Examples of suitable retroviruses include avian retroviruses, such as avian 

25 erythroblastosis virus (AEV), avian leukosis virus (ALV), avian myeloblastosis virus 
(AMV), avian sarcoma virus (AS V), Fujinami sarcoma vims (FuS V), spleen necrosis 
virus (SNV), and Rous sarcoma virus (RSV); bovine leukemia virus (BLV); feline 
retroviruses, such as feline leukemia virus (FeLV) or feline sarcoma virus (FeSV); 
murine retroviruses, such as murine leukemia virus (MuLV); mouse mammary tumor 

30 virus (MMTV), and murine sarcoma virus (MS V); and primate retroviruses, such as 
human T-cell lymphotropic viruses 1 and 2 (HTLV-1, and -2), and simian sarcoma 
virus (SSV). Many other suitable retroviruses are known to those skilled in the art A 
taxonomy of retroviruses is provided by Teich, in Weiss et al. (eds.), RNA Tumor 
Viruses, 2d ed„ Vol. 2 (Cold Spring Harbor Laboratory, New York, 1985, pp. 1- 

35 160). Preferred retroviruses for use in connection with the present invention are the 
murine retroviruses known as Moloney murine leukemia virus (MoMLV), Moloney 
murine sarcoma virus (MoMSV), Harvey murine sarcoma virus (HaMSV) and Kirsten 
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murine sarcoma virus (KiSV). the sequences required to construct a retroviral vector 
from the MoMSV genome can be obtained in conjunction with a pBR322 plasmid 
sequence such as pMV (ATCC 37190), while a cell line producer of KiSV in K-BALB 
cells has been deposited as ATCC CCL 163.3. A deposit of pRSVneo, derived from 
5 pBR322 including the RSV LTR and an intact neomycin drug resistance marker is 
available from ATCC under Accession No. 37198. Plasmid pPBlOl comprising the 
SNV genome is available as ATCC 45012. The viral genomes of the above 
retroviruses are used to construct replication-defective retrovirus vectors which are 
capable of integrating their viral genomes into the chromosomal DNA of an infected 

10 host cell but which, once integrated, are incapable of replication to provide infectious 
virus, unless the cell in which it is introduced contains other proviral elements encoding 
functional active rra/tf-acting viral proteins. 

The selectable fusion genes of the present invention which are transduced by 
retroviruses are expressed by placing the selectable fusion gene under the 

15 transcriptional control of the enhancer and promoter incorporated in the retroviral LTR, 
or by placing them under the control of a heterologous transcriptional control sequences 
inserted between the LTRs. Use of both heterologous transcriptional control sequences 
and the LTR transcriptional control sequences enables coexpression of a therapeutic 
gene and a selectable fusion gene in the vector, thus allowing selection of cells 

20 expressing specific vector sequences encoding the desired therapeutic gene product 
Obtaining high-level expression may require placing the therapeutic gene and/or 
selectable fusion gene within the retrovirus under the transcriptional control of a strong 
heterologous enhancer and promoter expression cassette. Many different heterologous 
enhancers and promoters have been used to express genes in retroviral vectors. Such 

25 enhancers or promoters can be derived from viral or cellular sources, including 
mammali an genomes, and are preferably constitutive in nature. Such heterologous 
transcriptional control sequences are discussed above with reference to recombinant 
expression vectors. To be expressed in die transduced cell, DNA sequences introduced 
by any of the above gene transfer methods are usually expressed under the control of an 

30 RNA polymerase n promoter. 

Particularly preferred recombinant expression vectors for use in retroviruses 
include pLXSN, pLNCX and pLNL6, and derivatives thereof, which are described by 
Miller and Rosman, Biotechniques 7:980, 1989. These vectors are capable of 
expressing heterologous DNA under the transcriptional control of the retroviral LTR or 

35 the CMV promoter, and the neo gene under the control of the SV40 early region 
promoter or the retroviral LTR. For use in the present invention, the neo gene is 
replaced with the bifunctional selectable fusion genes disclosed herein, such as the 
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HyTK selectable fusion gene. Construction of useful replication-defective retroviruses 
is a matter of routine skill. The resulting recombinant retroviruses are capable of 
integration into the chromosomal DNA of an infected host cell, but once integrated, arc 
incapable of replication to provide infectious virus, unless the cell in which it is 
5 introduced contains another proviral insert encoding functionally active transacting 
viral proteins. 

Uses of Bifunctional Selectable Fusion Genes 

The selectable fusion genes of the present invention are particularly preferred 

10 for use in gene therapy as a means for identifying, isolating or eliminating cells, such as 
somatic cells, into which the selectable fusion genes are introduced. In gene therapy, 
somatic cells are removed from a patient transduced with a recombinant expression 
vector containing a therapeutic gene and the selectable fusion gene of the present 
invention, and then reintroduced back into the patient Somatic cells which can be used 

15 as vehicles for gene therapy include hematopoietic (bone marrow-derived) cells, 
keratinocytes, hepatocytes, endothelial cells and fibroblasts (Friedman, Science 
244:1275, 1989); Alternatively, gene therapy can be accomplished through the use of 
injectable vectors which transduce somatic cells in vivo. Hie feasibility of gene transfer 
in humans has been demonstrated (Kasid et aL, Proc. Natl. Acad. Sci. USA S7:473, 

20 1990; Rosenberg et aL, N. Engl J. Med. 523:570, 1990). 

The selectable fusion genes of the present invention are particularly useful for 
el i m i nat ing genetically modified cells in vivo. In vivo elimination of cells expressing a 
negative selectable phenotype is particularly useful in gene therapy as a means for 
ablating a cell graft, thereby providing a means for reversing the gene therapy 

25 procedure. For example, it has been shown that administration of the anti-herpes virus 
drug ganciclovir to transgenic animals expressing the HSV-I TK gene from an 
immunoglobulin promoter results in the selective ablation of cells expressing the HSV-I 
TK gene (Heyman et aL, Proc. Natl. Acad. Sci (USA) S&2698, 1989). Using the 
same transgenic mice, GCV has also been shown to induce full regression of Abelson 

30 leukemia virus-induced lymphomas ((Moolten et aL, Human Gene Therapy i:125, 
1990). In a third study, in which a murine sarcoma (K3T3) was infected with a 
retrovirus expressing HSV-I TK and transplanted into syngeneic mice, the tumors 
induced by the sarcoma cells were completely eradicated following treatment with GCV 
(Moolten and Wells, /. Natl Cancer Inst. 82:297, 1990). 

35 The bifunctional selectable fusion genes of the present invention can also he 

used to facilitate gene modification by homologous recombination. Reid et aL, Proc. 
Natl. Acad. Sci. USA 87:4299, 1990 has recently described a two-step procedure for 
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gene modification by homologous recombination in ES cells ("in-out" homologous 
recombination) using the HPRT gene. Briefly, this procedure involves two steps: an 
"in" step in which the HPRT gene is embedded in target gene sequences, transfected 
into HPRT- host cells and homologous recombinants having incorporated the HPRT 
5 gene into the target locus are identified by their growth in HAT medium and genomic 
analysis using PCR. In a second "out" step, a construct containing the desired 
replacement sequences embedded in the target gene sequences (but without the HPRT 
gene) is transfected into the cells and homologous recombinants having the replacement 
sequences (but not the HPRT gene) are isolated by negative selection against HPRT*" 

10 cells. Although this procedure allows the introduction of subtle mutations into a target 
gene without introducing selectable gene sequences into the target gene, it requires 
positive selection of transformants in a HPRT- cell line, since the HPRT gene is 
recessive for positive selection. Also, due to the inefficient expression of the HPRT 
gene in ES cells, it is necessary to use a large 9-kbp HPRT mini-gene which 

15 complicates the construction and propagation of homologous recombination vectors. 
The selectable fusion genes of the present invention provide an improved means 
whereby "in-out" homologous recombination may be performed. Because the 
selectable fusion genes of the present invention are dominant for positive selection, any 
wild-type cell may be used (i.e., one is not limited to use of cells deficient in the 

20 selectable phenotype). Moreover, the size of the vector containing the selectable fusion 
gene is reduced significantly relative to the large HPRT mini-gene. 

By way of illustration, the HyTK selectable fusion gene is used as follows: In 
the first "in" step, the HyTK selectable fusion gene is embedded in target gene 
sequences, transfected into a host cell, and homologous recombinants having 

25 incorporated the HyTK selectable fusion gene into the target locus are identified by their 
growth in medium containing Hm followed by genome analysis using PCR. The 
HyTK + cells are then used in the second "out" step, in which a construct containing die 
desired replacement sequences embedded in the target gene sequences (but without the 
HyTK selectable fusion gene) is transfected into the cells. Homologous recombinants 

30 are isolated by selective elimination of HyTK+ cells using ganciclovir followed by 
genome analysis using PCR. 
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EXAMPLES 
Example 1 

5 

Construction and Characterization of 
Plasmid Vectors Containing HvTK Selectable Fusion Gene 

A. Construction of the Bifunctional HvTK Selectable Fusion Gene . The hph 
10 and HSV-I TK genes were first placed under the regulatory control of the HCMV 

promoter in tgCMV/hygro and tgCMV/TK, respectively. Plasmid tgCMV/hygro 
(Figure 1) consists of the following elements: the Ball-SstH fragment containing the 
HCMV IE94 promoter (Boshart et aL, Cell 47:521, 1985); an oligonucleotide 
containing a sequence conforming to a consensus translation initiation sequence for 

15 mammalian cells (GCCGCCACC ATG) (Kozak, Nucl. Acids Res. 75:8125, 1987); 
nucleotides 234-1256 from the hph gene (Kaster et aL, Nucl Acids Res. 11 :6895, 
1983), encoding hygromycin phosphotransferase; the BclI-BamHI fragment from the 
SV40 genome (Tooze, L, ed., Molecular Biology of Tumor Viruses, 2nd Ed. DNA 
Tumor Viruses. Cold Spring Harbor Laboratory, New York, 1981), containing the 

20 SV40 early region polyadenylation sequence; the NruI-AlwNI fragment from pML2d 
(Lusky and Botehan, Nature 293:19, 1981), containing the bacterial replication origin; 
and the AlwNI-Aatll fragment from pGEMl (Prornega Corporation), containing the B- 
lactamase gene. 

Plasmid tgCMV/TK (Figure 1) is similar to tgCMV/hygro, but contains 
25 nucleotides 5 19-1646 from the HSV-I TK gene (Wagner et aL, Proc. Natl. Acad. Sci. 
USA 78:1441, 1981) in place of the hph gene. 

Plasmid tgCMV/HyTK (Figure 1), containing the selectable fusion gene 
comprising the hph gene and the HSV-I TK gene, was constructed by inserting the 
1644-bp Spel-Scal fragment from tgCMV/hygro between the Spel and Mlul sites of 
30 tgCMV/TK. Before ligation, the Mlul site in the HSV-I TK gene was treated with T4 
DNA polymerase to allow blunt end ligation with the Seal site, thus preserving the 
open reading frame. Translation of this fused gene (referred to as the HyTK selectable 
fusion gene) is expected to generate a single bifunctional fusion protein, consisting of 
amino acids 1-324 from the hph protein and amino acids 10-376 from the HSV-I TK 
35 protein. The C-terminal 17 amino adds of die hph protein, and the N-terminal 9 amino 
acids of the TK protein, are deleted in the bifunctional fusion protein. 

B. Dominant Positive Selection of Cells Containing the HvTk Selectable 
Fusion Gene . To demonstrate that the HyTK selectable fusion gene encodes both hph 
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and TK enzymatic activities, the frequencies with which tgCMV/HyTK conferred 
hygromycin resistance (Hm r ) (in NIH/3T3 cells and Rat-2 cells), and the ability to 
grow in medium containing hypoxan thine, aminopterin, and thymidine (HAT 1 ) (in Rat- 
2 cells), were compared with those of the parental plasmids, tgCMV/hygro and 

5 tgCMV/TK, respectively. 

NIH/3T3 cells were grown in Dulbecco's Modified Eagle Medium (DMEM; 
Gibco Laboratories) supplemented with 10% bovine calf serum (Hyclone), 2 mM L- 
glutamine, 50 U/ml peniciltin, and 50 |ig/ml streptomycin at 37 °C in a humidified 
atmosphere supplemented with 10% C0 2 . TK* Rat-2 cells (Topp, Virology 223:408, 

10 1981) were grown in DMEM supplemented with 10% fetal bovine serum (Hyclone), 2 
mM L-glutamine, 50 U/ml penicillin, and 50 ^g/ml streptomycin at 37°C in a 
humidified atmosphere supplemented with 10% C0 2 . NIH/3T3 and Rat-2 cells were 
transfected with the DNA vectors described above by electroporation, as follows. 
Exponentially growing NIH/3T3 and Rat-2 cells were harvested by trypsinization, 

15 washed free of serum, and resuspended in DMEM at a concentration of 10 7 cells/ml. 
Supercoiled plasmid DNA (5 Ug) was added to 800 ul of cell suspension (8X10 6 cells), 
and the mixture subjected to electroporation using the Biorad Gene Pulser and 
Capacitance Extender (200-300 V, 960 uF, 0.4 cm electrode gap, at ambient 
temperature). Following transfection, the cells were returned to 9-cro dishes and 

20 grown in non-selective medium After 24 hours, the cells were trypsinized, seeded at 
6x10 s per 9-cm dish, and allowed to attach overnight The non-selective medium was 
replaced with selective medium (containing 500 jxg/ml hygromycin B for NIH3T3 
cells, and 300 ng/ml hygromycin B or HAT for Rat-2 cells), and selection was 
continued for approximately 10-12 days until colonies were evident The plates were 

25 stained with methylene blue and counted. The results are shown in Table 1 below. 
The number of colonies reported is the average number of colonies per 9-cm dish. 

TABLE 1 



30 



Positive Selection Using HyTK Fusion Gene 


Plasmid 


NIH3T3 Cells 


Rat-2 rfells 


No. Hm 1 
Colonies 


No.Hm r 
Colonies 


No. HATT 

Colonies 


tgCMV/hygro 


45 


368 


n.t. 


tgCMV/TK 


n.t. 


n.t. 


356 


tgCMV/HyTK 


100 


428 


124 



n.t= not tested. 
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In both cell lines, tgCMV/HyTK gave rise to Hm r colonies at a slightly higher 
frequency than tgCMV/hygro. However, in Rat-2 cells, tgCMV/HyTK was slightly 
less efficient than tgCMV/TK in generating HAT 1, colonies. This demonstrates that the 
5 HyTK selectable fusion gene encodes both hp h and TK enzymatic activities, although 
with altered efficiencies. 

C Negative Selection of Cells Containing the HvTK Selec table Fusion Gene . 
To investigate the utility of the HyTK selectable fusion gene for negative selection, the 
colonies resulting from each transfection (Table 1) were pooled and expanded into cell 
10 lines for further analysis. The Hnf NIH/3T3 cell pools and the Hitf and HAT 1 * Rat-2 
cell pools were tested for GCV 8 in a short term cell proliferation assay as follows. 

The transfected NM/3T3 and Ratr2 cells (3 x 10 4 of each) were seeded into 9- 
cm tissue culture dishes in complete growth medium, and allowed to attach for 4 hours. 
The medium was then supplemented with various concentrations of GCV (Syntex, Palo 
15 Alto, CA), and the cells incubated for an additional 60 hours. At this time, the medium 
was removed, the attached cells were harvested by trypsinization and stained with 
trypan blue, and viable cells were counted. Cell growth was expressed as a fraction of 
the cell growth observed in the absence of GCV. The results shown are the average of 
triplicate assays. 

20 The results shown in Figure 3 demonstrate that the HyTK selectable fusion 

gene confers GCV 8 in NIH/3T3 cells. The degree of inhibition of cell growth was 
proportional to the concentration of GCV used, and approached 100% at a 
concentration of 1 |iM. In contrast, NIH/3T3 cells transfected with tgCMV/hygro were 
not adversely affected by GCV over die range of concentrations tested (0.03 - 1.0 

25 MM). 

The results shown in Figure 4 indicate that the HyTK selectable fusion gene is 
more effective than the HSV-I TK gene for negative selection in Rat-2 cells. Growth of 
Rat-2 cells transfected with tgCMV/HyTK was almost completely inhibited even at the 
lowest concentration of GCV used (0.03 pM), whether the cells were initially selected 

30 for Hm r or HAT. Growth of Rat-2 cells transfected with tgCMV/hygro was not 
inhibited by GCV over the range of concentrations tested (0.03 \iM - 1.0 |xM). The 
growth of Rat-2 cells transfected with tgCMV/TK was inhibited by GCV, but the 
concentrations required for growth inhibition were much higher than those required to 
inhibit the growth of Rat-2 cells transfected with tgCMV/HyTK. The Rat-2 cells 

35 transfected with tgCMV/TK were less sensitive to GCV than the Rat-2 cells transfected 
with tgCMV/HyTK. This appears to conflict with the result obtained when the two 
genes were used for positive selection in Rat-2 cells (Table 1), which indicated that the 
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HyTK selectable fusion gene was less effective than the HSV-I TK gene in conferring 
HAT 1 "* A further observation concerning the relative sensitivities of these cell lines to 
GCV was that the NIH/3T3 cells transfected with tgCMV/HyTK were less sensitive to 
GCV than the Rat-2 cells transfected with tgCMV/HyTK. 
5 D. Northern Analysis of Transfected Cell Lines. To investigate the basis for 

the the differential sensitivities of the Hm r and HAT NIH/3T3 and Rat-2 cell pools to 
GCV (Figures 3 and 4), and the altered efficiency with which the HyTK selectable 
fusion gene gave rise to Hm r and HAT colonies (Table 1), Northern blots of mRNA 
from each cell pool were probed with sequences from the hph and HSV-I TK genes, as 
10 follows. 

Polyadenylated mRNA was prepared according to standard procedures 
(Ausabel et al., eds., Current protocols in Molecular Biology. Wiley, New York., 
1987). RNA samples (10 |xg) were electrophoresed through Lift agarose gels 
containing formaldehyde as described (Ausabel et al., supra). Following 

15 electrophoresis, the gels were inverted and blotted by capillary transfer in 20 x SSC 
onto Duralon UV nylon membranes (Stratagene). After fixing the mRNA to the 
membrane by UV-irradiation (0.12 J/cm 2 ), the membranes were incubated in Stark's 
buffer (50% formanride, 5 x SSC, 50 mM potassium phosphate (pH 6.5), lft SDS, 
0.1% Ficoll, 0.1% PVP, 0.1% BSA, 300 *ig/ml sheared and denatured salmon sperm 

20 DNA, 0.05% Sarkosyl) at 50 °C for several hours. A uniformly labelled single 
stranded antisense RNA probe specific for hph was prepared (Ausabel et al., supra), 1 
x 10 6 cpm/ml were added to the hybridization mixture, and the incubation was 
continued at 63 °C for 15 h. The membrane was then washed in 0.1 x SSC, 0.1% SDS 
at 63 °C, and exposed to autoradiographic film (Kodak XAR-5). For detection of 

25 HSV-I TK and fl-actin sequences, gel-purified restriction fragments from the HSV-I 
TK and B-Actin genes were radiolabelled by random priming (Ausabel et al., supra). 
Membranes were pre-hybridized in Starks buffer at 42°C for several hours, after which 
1 x 10 6 cpm/ml of probe was added to the hybridization mixture and incubation 
continued at 42°C for 15 hours. The membranes were then washed in 6 x SSC, 1% 

30 SDS at 63°C, and exposed to autoradiographic film (Kodak XAR-5). 

In both Rat-2 and NDH/3T3 cells, the steady state level of mRNA detected with 
the hph probe was higher in the cells transfected with tgCMV/hygro than the cells 
transfected with tgCMV/HyTK and selected for Hm r (Figure 5, gel A, lanes 5 and 6). 
This may indicate that a higher level of expression of die hph gene is required to confer 

35 resistance to equivalent levels of hygromycin B (300 Jig/ml in Rat-2, and 500 jig/ml in 
NIH/3T3), due to the fact that the Afunctional fusion protein is more effective than die 
hph protein at inactivating hygromycin B, or is more stable than the hph protein. This 
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conclusion is supported by the results in Table 1, which show that tgCMV/HyTK gave 
rise to a slightly greater number of Hitf colonies in both cell lines than did 
tgCMV/hygro. 

The RNA Northern analysis also indicated that die Rat-2 cells transfected with 
5 tgCMV/TK expressed a steady state level of mRNA similar to the Rat-2 cells 
transfected with tgCMV/HyTK and selected for HAT* (Figure 5, gel B, lanes 2 and 4). 
However, tgCMV/TK gave rise to a greater number of HAT* cells than did 
tgCMV/HyTK (Table 1). This suggests that the HyTK selectable fusion protein is less 
effective than the HSV-I TK protein at phosphorylating thymidine, or is less stable than 

10 the HSV-I TK protein. 

Finally, the Rat-2 cells transfected with tgCMV/HyTK expressed steady state 
levels of mRNA several fold higher than (when selected for Hirf: Figure 5, gel B, lane 
3), or similar to (when selected for HAT 1 ; Figure 5, gel B, lane 4), the Rat-2 cells 
transfected with tgCMV/TK (Figure 5, gel B, lane 2). However, both the Rat-2 cell 

15 pools transfected with tgCMV/HyTK were over 30-fold more sensitive to GCV than 
the Rat-2 cells transfected with tgCMV/TK (Figure 4). This suggests that the 
bifiinctional fusion protein is markedly more effective than the HSV-I TK protein at 
phosphorylating ganciclovir, or is markedly more stable than the HSV-I TK protein. 
The increased ability of the Afunctional fusion protein to confer GCV S , and 

20 concomitant decreased ability to confer HAF, suggests that the substrate affinity of the 
bifiinctional fusion protein is altered relative to that of the HSV-I TK protein, rather 
than the stability. 

Example 2 

25 

Construction and Characterization of 
Retroviral Vectors C ontaining HvTK Selectable Fusion Gene 

A. Construction of Retroviral Vectors. Two retroviral expression vectors 
30 containing the HyTK selectable fusion gene were constructed. In the first, 
tgLS(+)HyTK, the HyTK selectable fusion gene was placed under the regulatory 
control of the promoter present in the retroviral LTR. In the second, tgLS(- 
)CMV/HyTK, the HyTK selectable fusion gene was placed under the regulatory control 
of theHCMV promoter. 
35 The retroviral expression vector tgLS(+)HyTK (die proviral structure of which 

is shown in Figure 2) consists of the following elements: the 5' LTR and sequences 
through the Psfl site at nucleotide 984 of MoMS V (Van Beveren et aL, Cell 27:97, 
1981); sequences from the PstI site at nucleotide 563 to nucleotide 1040 of MoMLV 
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(Shinnick ct al., Nature 295:543, 1981), incorporating point mutations (ATG -> TAG) 
which eliminate the Pr65 gag translation initiation codon (Bender et al., /. Virol. 
67:1639, 1987); a fragment from tgCMV/HyTK, containing the HyTK selectable 
fusion gene; sequences from nucleotide 7764 and through the 3' LTR of MoMLV 
5 (Shinnick et al., Nature 295:543, 1981); the NruI-AlwNI fragment from pML2d 
(Lusky and Botchan, Nature 295:79, 1981), containing the bacterial replication origin; 
and the AlwNI-Aatll fragment from pGEMl (Promega Corporation), containing the B- 
lactamase gene. 

The retroviral expression vector tgLS(-)CMV/HyTK (the proviral structure of 
10 which is shown in Figure 2) is similar to tgLS(+)HyTK, but carries a point mutation 
(AGGT AGGC) which eliminates the MoMSV-derived splice donor sequence 
(transferred from the retroviral vector, AH [Overell et aL, MoL Cell. Biol. 5:1803, 
1988]), and contains the HCMV promoter upstream of the HyTK selectable fusion 
gene sequences. 

15 The retroviral expression vector tgLS(+)HyTK/stop (the proviral structure of 

which is shown in Figure 2) was derived from tgLS(+)HyTK by inserting the universal 
translation terminator oligonucleotide (Pharmacia), 5 , -GCTTAATTAATTAAGC-3\ at 
the Nael site located near the junction of the hph and HSV-I TK sequences of the 
HyTK selectable fusion gene. 

20 B. Generation of Stable Cell Lines Producing Retroviral Vectors . Stable ¥2 

packaging cell lines were generated which produce the above ecotropic retroviruses as 
follows. ¥2 cells (Mann et aL, Cell 55:153, 1983) were grown in Dulbecco's 
Modified Eagle Medium (DMEM; Gibco Laboratories) supplemented with 10% bovine 
calf serum (Hyclone), 2 mM L-glutamine, 50 U/ml penicillin, and 50 ^ig/ml 

25 streptomycin at 37 °C in a humidified atmosphere supplemented with 10% C0 2 . 

PA317 cells (Miller and Buttimore, MoL Cell Biol d:2895, 1986) were grown in 
DMEM supplemented with 10% fetal bovine serum (Hyclone), 2 mM L-glutamine, 50 
U/ml penicillin, and 50 |ig/ml streptomycin at 37 °C in a humidified atmosphere 
supplemented with 10% C0 2 . 

30 The retroviral expression vectors described above were first transfected into 

PA317 amphotropic packaging cells by electroporation. Amphotropic virions produced 
by the transiently transfected PA317 packaging cells were then used to infect the *¥2 
cells as follows. Exponentially growing PA317 cells were harvested by trypsinization, 
washed free of serum, and resuspended in DMEM at a concentration of 10 7 cells/ml. 

35 Supercoiled plasmid DNA (5 |ig) was added to 800 \d of cell suspension (8x10 s cells), 
and the mixture subjected to electroporation using the Biorad Gene Pulser and 
Capacitance Extender (200-300 V, 960 |iF, 0.4 cm electrode gap, at ambient 
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temperature). The transfected PA317 cells were then transferred to a 9-cm tissue 
culture dish containing 10 ml of complete growth medium supplemented with 10 mM 
sodium butyrate (Sigma Chemical Co.), and allowed to attach overnight After 15 
hours, the medium was removed and replaced with fresh medium. After a further 24 
5 hours, the medium containing transiently produced amphotropic retrovirus particles 
was harvested, centrifuged at 2000 rpm for 10 min, and used to infect the Y2 ecotropic 
packaging cells. Exponentially dividing ¥2 cells were plated at a density of 10 6 cells 
per 9-cm tissue culture dish, and allowed to attach overnight The following day, the 
medium was removed and replaced with serial dilutions of the virus-containing 

10 supernatant (6 ml/dish) in medium supplemented with 4 jig/ml polybrene (Sigma 
Chemical Co.). Infection of die *P2 cells by the viral particles was allowed to proceed 
overnight, and then the supernatant was replaced with complete growth medium. 
Infected cells were selected for drug resistance after a further 8-24 hours of growth by 
adding hygromycin B (Calbiochem) to a final concentration of 500 |lg/ml. Colonies of 

15 Hm r cells were isolated using cloning cylinders 12-14 days later, and individually 
expanded into bulk cultures for analysis. Southern analysis (data not shown) revealed 
that the proviral structures were intact in six out of six independent clones, indicating 
that the HyTK selectable fusion gene is compatible with the retroviral life cycle. 

C. Transduction of Hitf, HATT and GCVS bv tgLSf+WTK and tgLSf-Y 

20 CMV/HvTK Retroviral Expression Vectors . The infected ¥2 clones were titered on 
NIH/3T3 cells (selecting for Hitf), and on Rat-2 cells (selecting for Hm r , or for HATQ 
(Table 2), as follows. The *F2 clones producing the virus woe grown to confluence in 
9-cm tissue culture dishes, then fed with 15 ml of drug-free medium. After an 
overnight incubation, aliquots of supernatant were taken for assay. Exponentially 

25 dividing NIH/3T3 or Rat-2 cells were harvested by trypsinization and seeded at a 
density of 2.5 x 10 4 cells per 35 mm well in 6-well tissue culture trays. Hie following 
day, the medium was replaced with serial dilutions of virus-containing supernatant (1 
ml/well) in medium supplemented with 4 jig/ml polybrene (Sigma Chemical Co.). All 
supernatants were centrifuged at 2000 rpm for 10 min before use to remove viable 

30 cells. Infection was allowed to proceed overnight, and then the supernatant was 
replaced with complete growth medium. Infected cells were selected for drug 
resistance after a further 8-24 hours of growth by adding hygromycin B (Calbiochem) 
to a final concentration of 500 Jig/ml (NIH/3T3 cells) or 300 |ig/ml (Rat-2 cells), or by 
adding HAT supplement (Gibco) (Rat-2 cells). After a total of 12-14 days of growth, 

35 cells were fixed in situ with 100% methanol, and stained with methylene blue. 

As shown in Table 2, below, both retroviruses conferred Hnf (to NIH/3T3 and 
Rat-2 cells) and HAT* (to Rat-2 cells). All viruses were harvested from a clone of 
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infected ¥2 cells. 

TABLE 2 



Titers of ecotropic retroviruses produced by *P2 
packaging cells on NIH/3T3 cells and Rat-2 cells. 



10 



15 





NIH/3T3 
Hrrf 


Rat-2 

Hnf 


HAT 


Virys 


CFU/ml 


CFU/ml 


mi/mi 


tgLS(+)HyTK 


1.8 x 10 7 


1.6 x 10 7 


4X10 6 


clone 5.5 








tgLS (-)CMV/HyTK 


lxlO 6 


lxlO 6 


8 x10 s 


clone 6.2 









20 To demonstrate that the tgLS(+)HyTK and tgLS(-)CMV/HyTK retroviruses 

also conferred GCV S , NIH/3T3 cells infected with the two retroviruses, were selected 
for Hrrf (500 ng/ml) for 10 days, and then pooled, expanded, and tested for GCV S in 
die following long-term cell proliferation assay. 

Uninfected NIH/3T3 cells and the infected NIH/3T3 cell pools were plated at 

25 relatively low cell density (10 4 cells/9-cm dish) in complete growth medium and 
allowed to attach for 4 hours. Hie medium was then supplemented with hygromycin B 
(500 ng/ml), with or without 1 nM GCV, and the cells incubated for a period of 10 
days. The medium was then removed and the cells were fixed in situ with 100% 
methanol and stained with methylene blue. The growth of both cell pools, as measured 

30 by colony formation, was almost completely inhibited by GCV (Figure 6, plates e and 
g), indicating that both retroviruses conferred Hm r and GCV S . Uninfected NIH/3T3 
were resistant to this concentration of GCV, and grew to a confluent monolayer (Figure 
6, plate b), but were completely killed by 500 ng/ml Hm (Figure 6, plate c). Colonies 
of cells resistant to both Hm and GCV were obtained at a low frequency (lOMO 3 ) 

35 from the retrovirus-infected populations (Figure 6, plates e and g). The proviruses 
present in the cells that gave rise to these colonies had likely suffered point mutations, 
or very small deletions or rearrangements in the HSV-I TK moiety which eliminated the 
ability to phosphorylate GCV. Similar results were also obtained with Rat-2 cell lines 
infected with tgLS(+)HyTK or with tgLS(-)CMV/HyTK (data not shown). 
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Example 3 

Evidence for the Pnxiuction of a Bifunctional Selectable Fusion Protein 

5 

In HSV-I infected cells, the HS V-I TK gene normally utilizes three translation 
initiation sites, and encodes three nested polypeptides which all possess TK activity 
(Haarr et ah, /. Virol 56:512, 1985). Since the HyTK selectable fusion gene retains 
two of these initiation codons, it was conceivable that, as a result of translation 

10 initiation at one or both of these internal AUG codons, the HyTK selectable fusion gene 
might also encode nested polypeptides possessing TK activity. Hie bifunctional fusion 
protein, while retaining hph activity, might or might not possess TK activity. To rule 
out this possibility, an oligonucleotide sequence, 5'-GCTTAATTAATTAAGC-3\ 
bearing translation termination codons in all three reading frames was introduced into 

15 the HyTK selectable fusion gene in tgLS(+)HyTK, generating the construct designated 
tgLS(+)HyTK/stop (Figure IB). The oligonucleotide was inserted at a Nael site 
downstream of the ApA-derived sequences, but upstream of the two internal AUG 
codons in the HSV-I TK-derived sequences of the HyTK selectable fusion gene 
(Figure IB). The tgLS(+)HyTK and tgLS(+)HyTK/stpp retroviral expression vectors 

20 were transfected into ¥2 cells, and the transiently produced virus was used to infect 
Rat-2 cells, which were then selected for Hm r or HAT* (Table 3). The retroviral 
expression vector tgLS(+)HyTK transduced both Hm r and HAT 1 ", but retroviral 
expression vector tgLS(+)HyTK/stop was only able to transduce Hnf . Insertion of the 
translation termination codons completely abolished the ability of the retrovirus to 

25 transduce HAT r , indicating that the internal translation initiation codons are not utilized 
in the HyTK selectable fusion gene, and the HyTK selectable fusion gene does indeed 
encode a bifunctional fusion protein. Viruses were harvested from transiently 
transfected *¥2 cells. 

TABLE 3 



30 



Titers of ecotropic retroviruses produced transiently in *F2 
packaging cells on NIH/3T3 cells and Rat-2 cells. 



Rat-? 



35 En? Hrf HAT 

YiffiS CFU/ml CFU/ml CRT/ml 

tgLS(+)HyTK 4.5 xlO 4 9.5 xlO 3 1.1 x 10 4 

tgLS(-)CMV/HyTK 2.6 xlO 4 5.9 xlO 4 0 
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As described in the above examples, retroviral expression vectors containing the 
HyTK selectable fusion gene were constructed and used to demonstrate the efficacy of 
the HyTK selectable fusion gene for positive and negative selection in NIH/3T3 and 
Rat-2 cells. High titer virus stocks were generated, which conferred both Hm r and 
5 HAT 1 " on infected cells. Infected cells contained unreairanged proviruses and were 
killed (>99.9%) by GCV. The HyTK selectable fusion gene was slightly more 
effective than the hph gene at conferring Hm T in both NIH/3T3 and Rat-2 cells (Table 
1). Genetic evidence that the HyTK selectable fusion gene encodes a bifunctional 
fusion protein possessing hph and HSV-I TK enzymatic activities was obtained by 

10 inserting translation termination codons into the HyTK selectable fusion gene (in 
tgLS(+)HyTK/stop; Figure 2), downstream of the /ipA-derived sequences, but 
upstream of the HSV-I TK-derived sequences. As would be expected if the HyTK 
selectable fusion gene encoded a bifunctional fusion protein, insertion of the translation 
termination codons left the ability of the virus to confer Hitf intact, but abolished the 

15 ability of the retrovirus to transduce HAT? (Table 3). When compared with the HSV-I 
TK gene in Rat-2 cells, the HyTK selectable fusion gene was slightly less effective at 
conferring ability to grow in HAT medium (Table 1), but markedly more effective at 
conferring GCV S (Figure 4). These observations cannot be explained on the basis of 
the relative steady state levels of mRNA expression (Figure 5), nor on the basis of 

20 changes in the stability of the HyTK selectable fusion protein. The apparent 
contradiction might be explained by hypothesizing that the HSV-I TK-derived moiety 
of the HyTK selectable fusion protein possesses a substrate affinity different from that 
of the wild-type HSV-I TK protein (possibly due to conformational change), with a 
reduced ability to phosphorylate thymidine and an increased ability to phosphorylate 

25 GCV. Altered substrate affinities have been noted previously in a number of 
pathogenic drug-resistant strains of HSV-I, which encode mutant TK proteins that 
exhibit a reduced ability to phosphorylate thymidine analogs, yet retain the ability to 
phosphorylate thymidine (Larder et aL, /. Biol. Chem. 255:2027, 1983; Palu et al., 
Virus Res. i3:303, 1989; Larder and Daiby, Antiviral Res. 4:1,1984). The slightly 

30 increased efficiency with which the HyTK selectable fusion gene confers Hirf, relative 
to the hph gene (Table 1), may be due to an increase in protein stability, or an increased 
specific activity of the phosphotransferase. 

Moreover, a single approximately 76 kD protein was specifically 
immunoprecipitated by a rabbit polyclonal antiserum directed against HSV-I TK from 

35 extracts of cells expressing the HyTK selectable fusion gene. Thus the phenotype 
conferred by the HyTK selectable fusion gene was not due to internal translation 
initiation in the HSV-I TK derived moiety of the gene, and the HyTK selectable fusion 
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gene does indeed encode a Afunctional selectable fusion gene. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Lupton, Stephen D. 
(ii) TITLE OF INVENTION: Bifunctional Selectable Fusion Genes 
(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Iimminex Corporation 
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(F) ZIP: 98101 

(v) COMPUTER READABLE FORM: 
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(B) COMPUTER: Apple Macintosh 

(C) OPERATING SYSTEM: Maintosh 

(D) SOFTWARE: Microsoft Word 
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(A) APPLICATION NUMBER: 
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(ix) ATTORNEY /AGENT INFORMATION: 
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(x) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206)587-0430 
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(C) TELEX: 756822 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2076 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
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<B) LOCATION: 1..2076 
<D) OTHER INFORMATION: 
(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..2073 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG AAA AAG CCT GAA CTC ACC GCG ACG TCT GTC GAG AAG TTT CTG ATC 48 
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu lie 
1 5 10 15 

GAA AAG TTC GAC AGC GTC TCC GAC CTG ATG CAG CTC TCG GAG GGC GAA 96 
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gin Leu Ser Glu Gly Glu 
20 25 30 

GAA TCT CGT GCT TTC AGC TTC GAT GTA GGA GGG CGT GGA TAT GTC CTG 144 
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu 
35 40 ^ 45 

CGG GTA AAT AGC TGC GCC GAT GGT TTC TAC AAA GAT CGT TAT GTT TAT 192 
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr 
50 55 60 

CGG CAC TTT GCA TCG GCC GCG CTC CCG ATT CCG GAA GTG CTT GAC ATT 240 
Arg His Phe Ala Ser Ala Ala Leu Pro He Pro Glu Val Leu Asp He 
65 70 75 80 

GGG GAA TTC AGC GAG AGC CTG ACC TAT TGC ATC TCC CGC CGT GCA CAG 288 
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys He Ser Arg Arg Ala Gin 
85 90 95 

GGT GTC ACG TTG CAA GAC CTG CCT GAA ACC GAA CTG CCC GCT GTT CTG 336 
Gly Val Thr Leu Gin Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu 
100 105 HO 

CAG CCG GTC GCG GAG GCC ATG GAT GCG ATC GCT GCG GCC GAT CTT AGC 384 
Gin Pro Val Ala Glu Ala Met Asp Ala He Ala Ala Ala Asp Leu Ser 
115 120 125 

CAG ACG AGC GGG TTC GGC CCA TTC GGA CCG CAA GGA ATC GGT CAA TAC 432 
Gin Thr Ser Gly Phe Gly Pro Phe Gly Pro Gin Gly He Gly Gin Tyr 
130 135 140 

ACT ACA TGG CGT GAT TTC ATA TGC GCG ATT GCT GAT CCC CAT GTG TAT 480 
Thr Thr Trp Arg Asp Phe He Cys Ala He Ala Asp Pro His Val Tyr 
145 150 155 160 

CAC TGG CAA ACT GTG ATG GAC GAC ACC GTC AGT GCG TCC GTC GCG CAG 528 
His Trp Gin Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gin 
165 170 175 

GCT CTC GAT GAG CTG ATG CTT TGG GCC GAG GAC TGC CCC GAA GTC CGG 576 
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg 
180 185 190 

CAC CTC GTG CAC GCG GAT TTC GGC TCC AAC AAT GTC CTG ACG GAC AAT 624 
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn 
195 200 205 
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GGC CGC ATA ACA GCG GTC ATT GAC TGG AGC GAG GCG ATG TTC GGG GAT 672 
Gly Arg He Thr Ala Val He Asp Trp Ser Glu'Ala Met Phe Gly Asp 
210 215 220 

TCC CAA TAC GAG GTC GCC AAC ATC TTC TTC TGG AGG CCG TGG TTG GCT 720 
Ser Gin Tyr Glu Val Ala Asn He Phe Phe Trp Arg Pro Trp Leu Ala 
225 230 235 240 

TGT ATG GAG CAG CAG ACG CGC TAC TTC GAG CGG AGG CAT CCG GAG CTT 768 
Cys Met Glu Gin Gin Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu 
245 250 255 

GCA GGA TCG CCG CGG CTC CGG GCG TAT ATG CTC CGC ATT GGT CTT GAC 816 
Ala Gly Ser Pro Arg Leu Arg Ala Tyr Met Leu Arg He Gly Leu Asp 
260 265 270 

CAA CTC TAT CAG AGC TTG GTT GAC GGC AAT TTC GAT GAT GCA GCT TGG 864 
Gin Leu Tyr Gin Ser Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp 
275 280 285 

GCG CAG GGT CGA TGC GAC GCA ATC GTC CGA TCC GGA GCC GGG ACT GTC 912 
Ala Gin Gly Arg Cys Asp Ala He Val Arg Ser Gly Ala Gly Thr Val 
290 295 300 

GGG CGT ACA CAA ATC GCC CGC AGA AGC GCG GCC GTC TGG ACC GAT GGC 
Gly Arg Thr Gin He Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly 
305 " 310 315 320 

TGT GTA GAA GTC GCG TCT GCG TTC GAC CAG GCT GCG CGT TCT CGC GGC 
Cys Val Glu Val Ala Ser Ala Phe Asp Gin Ala Ala Arg Ser Arg Gly 
325 330 335 



960 



1008 



CAT AGC AAC CGA CGT ACG GCG TTG CGC CCT CGC CGG CAG CAA GAA GCC 1056 
His Ser Asn Arg Arg Thr Ala Leu Arg Pro Arg Arg Gin Gin Glu Ala 
340 345 350 

ACG GAA GTC CGC CCG GAG CAG AAA ATG CCC ACG CTA CTG CGG GTT TAT 1104 
Thr Glu Val Arg Pro Glu Gin Lys Met Pro Thr Leu Leu Arg Val Tyr 
355 - 360 365 

ATA GAC GGT CCC CAC GGG ATG GGG AAA ACC ACC ACC ACG CAA CTG CTG 1152 
He Asp Gly Pro His Gly Met Gly Lys Thr Thr Thr Thr Gin Leu Leu 
370 375 380 

GTG GCC CTG GGT TCG CGC GAC GAT ATC GTC TAC GTA CCC GAG CCG ATG 1200 
Val Ala Leu Gly Ser Arg Asp Asp He Val Tyr Val Pro Glu Pro Met 
385 ~ 390 395 400 

ACT TAC TGG CGG GTG CTG GGG GCT TCC GAG ACA ATC GCG AAC ATC TAC 1248 
Thr Tyr Trp Arg Val Leu Gly Ala Ser Glu Thr He Ala Asn He Tyr 
405 410 415 

ACC ACA CAA CAC CGC CTC GAC CAG GGT GAG ATA TCG GCC GGG GAC GCG 1296 
Thr Thr Gin His Arg Leu Asp Gin Gly Glu He Ser Ala Gly Asp Ala 
420 425 430 

GCG GTG GTA ATG ACA AGC GCC CAG ATA ACA ATG GGC ATG CCT TAT GCC 1344 
Ala Val Val Met Thr Ser Ala Gin He Thr Met Gly Met Pro Tyr Ala 
435 440 445 
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GTG ACC GAC GCC GTT CTG GCT CCT CAT ATC GGG 'GGG GAG GCT GGG AGC 1392 
Val Thr Asp Ala Val Leu Ala Pro His He Gly Gly Glu Ala Gly Ser 
450 455 460 

TCA CAT GCC CCG CCC CCG GCC CTC ACC CTC ATC TTC GAC CGC CAT CCC 1440 
Ser His Ala Pro Pro Pro Ala Leu Thr Leu He Phe Asp Arg His Pro 
465 470 475 " ~ 480 

ATC GCC GCC CTC CTG TGC TAC CCG GCC GCG CGG TAC CTT ATG GGC AGC 1488 
He Ala Ala Leu Leu Cys Tyr Pro Ala Ala Arg Tyr Leu Met Gly Ser 
485 490 ** 495 

ATG ACC CCC CAG GCC GTG CTG GCG TTC GTG GCC CTC ATC CCG CCG ACC 1536 
Met Thr Pro Gin Ala Val Leu Ala Phe Val Ala Leu He Pro Pro Thr 
500 505 510 

TTG CCC GGC ACC AAC ATC GTG CTT GGG GCC CTT CCG GAG GAC AGA CAC 1584 
Leu Pro Gly Thr Asn He Val Leu Gly Ala Leu Pro Glu Asp Arg His 
515 520 525 

ATC GAC CGC CTG GCC AAA CGC CAG CGC CCC GGC GAG CGG CTG GAC CTG 1632 
He Asp Arg Leu Ala Lys Arg Gin Arg Pro Gly Glu Arg Leu Asp Leu 
530 535 540 

GCT ATG CTG GCT GCG ATT CGC CGC GTT TAC GGG CTA CTT GCC AAT ACG 1680 
Ala Met Leu Ala Ala He Arg Arg Val Tyr Gly Leu Leu Ala Asn Thr 
545 550 555 560 

GTG CGG TAT CTG CAG GGC TGC GGG TCG TGG CGG GAG GAC TGG GGA CAG 1728 
Val Arg Tyr Leu Gin Gly Cys Gly Ser Trp Arg Glu Asp Trp Gly Gin 
565 570 575 

CTT TCG GGG ACG GCC GTG CCG CCC CAG GGT GCC GAG CCC CAG AGC AAC 1776 
Leu Ser Gly Thr Ala Val Pro Pro Gin Gly Ala Glu Pro Gin Ser Asn 
580 585 590 

GCG GGC CCA CGA CCC CAT ATC GGG GAC ACG TTA TTT ACC CTG TTT CGG 1824 
Ala Gly Pro Arg Pro His He Gly Asp Thr Leu Phe Thr Leu Phe Arg 
595 600 605 

GCC CCC GAG TTG CTG GCC CCC AAC GGC GAC CTG TAT AAC GTG TTT GCC 1872 
Ala Pro Glu Leu Leu Ala Pro Asn Gly Asp Leu Tyr Asn Val Phe Ala 
610 615 620 

TGG GCC TTG GAC GTC TTG GCC AAA CGC CTC CGT TCC ATG CAC GTC TTT 1920 
Trp Ala Leu Asp Val Leu Ala Lys Arg Leu Arg Ser Met His Val Phe 
625 630 635 640 

ATC CTG GAT TAC GAC CAA TCG CCC GCC GGC TGC CGG GAC GCC CTG CTG 1968 
He Leu Asp Tyr Asp Gin Ser Pro Ala Gly Cys Arg Asp Ala Leu Leu 
645 650 655 

CAA CTT ACC TCC GGG ATG GTC CAG ACC CAC GTC ACC ACC CCC GGC TCC 2016 
Gin Leu Thr Ser Gly Met Val Gin Thr His Val Thr Thr Pro Gly Ser 
660 665 670 

ATA CCG ACG ATA TGC GAC CTG GCG CGC ACG TTT GCC CGG GAG ATG GGG 2064 
He Pro Thr He Cys Asp Leu Ala Arg Thr Phe Ala Arg Glu Met Gly 
675 680 685 
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GAG GCT AAC TGA 2076 
Glu Ala Asn . 
690 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 691 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu He 
15 10 15 

Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gin Leu Ser Glu Gly Glu 
20 25 30 

Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu 
35 40 45 

Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr 
50 55 60 

Arg His Phe Ala Ser Ala Ala Leu Pro He Pro Glu Val Leu Asp He 
65 70 75 80 

Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys He Ser Arg Arg Ala Gin 
85 90 95 

Gly Val Thr Leu Gin Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu 
100 105 HO 

Gin Pro Val Ala Glu Ala Met Asp Ala He Ala Ala Ala Asp Leu Ser 
115 120 125 

Gin Thr Ser Gly Phe Gly Pro Phe Gly Pro Gin Gly He Gly Gin Tyr 
130 ~ 135 140 

Thr Thr Trp Arg Asp Phe He Cys Ala He Ala Asp Pro His Val Tyr 
145 150 ■ 155 160 

His Trp Gin Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gin 
165 170 175 

Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg 
180 185 190 

His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn 
195 200 205 

Gly Arg He Thr Ala Val lie Asp Trp Ser Glu Ala Met Phe Gly Asp 
210 215 220 

Ser Gin Tyr Glu Val Ala Asn He Phe Phe Trp Arg Pro Trp Leu Ala 
225 . 230 235 240 
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Cys Met Glu Gin Gin Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu 
245 250 255 

Ala Gly Ser Pro Arg Leu Arg Ala Tyr Met Leu Arg lie Gly Leu Asp 
260 265 270 

Gin Leu Tyr Gin Ser Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp 
275 280 285 

Ala Gin Gly Arg Cys Asp Ala lie Val Arg Ser Gly Ala Gly Thr Val 
290 295 300 

Gly Arg Thr Gin lie Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly 
305 310 315 * 320 

Cys Val Glu Val Ala Ser Ala Phe Asp Gin Ala Ala Arg Ser Arg Gly 
325 330 ~ 335 

His Ser Asn Arg Arg Thr Ala Leu Arg Pro Arg Arg Gin Gin Glu Ala 
340 345 ^ 350 

Thr Glu Val Arg Pro Glu Gin Lys Met Pro Thr Leu Leu Arg Val Tyr 
355 360 365 

lie Asp Gly Pro His Gly Met Gly Lys Thr Thr Thr Thr Gin Leu Leu 
370 375 380 

Val Ala Leu Gly Ser Arg Asp Asp lie Val Tyr Val Pro Glu Pro Met 
385 390 395 400 

Thr Tyr Trp Arg Val Leu Gly Ala Ser Glu Thr lie Ala Asn He Tyr 
405 410 415 

Thr Thr Gin His Arg Leu Asp Gin Gly Glu lie Ser Ala Gly Asp Ala 
420 425 430 

Ala Val Val Met Thr Ser Ala Gin He Thr Met Gly Met Pro Tyr Ala 
435 440 445 

Val Thr Asp Ala Val Leu Ala Pro His lie Gly Gly Glu Ala Gly Ser 
450 455 460 

Ser His Ala Pro Pro Pro Ala Leu Thr Leu He Phe Asp Arg His Pro 
465 470 475 480 

He Ala Ala Leu Leu Cys Tyr Pro Ala Ala Arg Tyr Leu Met Gly Ser 
485 490 495 

Met Thr Pro Gin Ala Val Leu Ala Phe Val Ala Leu He Pro Pro Thr 
500 505 510 

Leu Pro Gly Thr Asn He Val Leu Gly Ala Leu Pro Glu Asp Arg His 
515 520 525 

He Asp Arg Leu Ala Lys Arg Gin Arg Pro Gly Glu Arg Leu Asp Leu 
530 535 540 

Ala Met Leu Ala Ala He Arg Arg Val Tyr Gly Leu Leu Ala Asn Thr 
545 550 555 560 

Val Arg Tyr Leu Gin Cys Gly Gly Ser Trp Arg Glu Asp Trp Gly Gin 
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565 



570 



575 



Leu Ser Gly Thr Ala Val Pro Pro Gin Gly Ala Glu Pro Gin Ser Asn 
580 585 590 

Ala Gly Pro Arg Pro His He Gly Asp Thr Leu Phe Thr Leu Phe Arg 

605 



595 



600 



Ala Pro Glu Leu Leu Ala Pro Asn Gly Asp Leu Tyr Asn Val Phe Ala 

615 620 



610 



Trp Ala Leu Asp Val Leu Ala Lys Arg Leu Arg Ser Met His Val Phe 



625 



630 



lie Leu Asp Tyr Asp Gin Ser Pro Ala Gly Cys Arg Asp Ala Leu Leu 
645 650 655 

Gin Leu Thr Ser Gly Met Val Gin Thr His Val Thr Thr Pro Gly Ser 
660 665 670 

lie Pro Thr He Cys Asp Leu Ala Arg Thr Phe Ala Arg Glu Met Gly 
675 680 685 



Glu Ala Asn 
690 
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CLAIMS 

1. A selectable fusion gene comprising a dominant positive selectable gene 
fused to and in reading frame with a negative selectable gene, wherein the selectable 
fusion gene encodes a single bifunctional fusion protein which is capable of conferring 
a dominant positive selectable phenotype and a negative selectable phenotype on a 
cellular host 

2. A selectable fusion gene according to claim 1, wherein the dominant 
positive selectable gene is selected from the group consisting of hph, neo, gpt and the 
negative selectable gene is selected from the group consisting of HSV-I TK, HPRT, 
APRTandg/tf. 

3. A selectable fusion gene according to claim 1, wherein the dominant 
positive selectable marker is hph and the negative selectable marker is HSV-I TK. 

4. A selectable fusion gene according to claim 3 encoding the sequence of 
amino acids 1-691 of SEQ ID NO:2. 

5. A selectable fusion gene according to claim 4 comprising the sequence of 
nucleotides 1-2073 of SEQ ID NO: 1. 

6. A recombinant expression vector comprising a selectable fusion gene 
according to any one of claims 1 through 5. 

7. A recombinant expression vector according to claim 6, wherein the 
vector is a retrovirus. 



8. A cell transduced with a recombinant expression vector according to claim 

6. 



9. A method for conferring a dominant positive and negative selectable 
phenotype on a cell, comprising the step of transducing the cell with a recombinant 
expression vector according to claim 6. 

10. A method for conferring a dominant positive and negative selectable 
phenotype on a cell, comprising the step of transducing the cell with a recombinant 
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expression vector according to claim 7. 

11. A method for isolating cells having a negative selectable phenotype 
comprising the steps of: 

(a) transducing a population of cells with a recombinant expression 
vector having a dominant positive selectable gene fused to and in reading frame with a 
negative selectable gene, thereby conferring the cells with a dominant positive selectable 
phenotype and a negative selectable phenotype; 

(b) applying positive selection to select cells having a dominant positive 
selectable phenotype, thereby concomitantly selecting cells having a negative selectable 
phenotype. 
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